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Street : 
City : Athens 
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Application Project 

<120> Title : Arabitol or Ribitol As Positive Selectable Markers 

<13 0> AppFileRef erence : UGA-855R 

<14 0> CurrentAppNumber : 

<141> CurrentFilingDate : - - 



Sequence 

<213> OrganismName : Escherichia coli 
<400> PreSequenceString : 

atgaacgaac aatttacatg gctgcacatc gggttaggtt cttttcatcg cgcacatcag 
gcgtggtatc tacaccgttt gcaggtgatg ggcgataaac gctggagcat tgctgcgggc 
aatattcgta atgatgctga acatgtcgta caggcactca gtgcacagaa aggtcgctat 
gtgctggaaa ccgtcagccc ggaaggggta agcgaatatg aagagatcac ctcaattcag 
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aagttgatac cgtggcaggc agatttacaa ccgctgattg ctgaaggggc agatccgaag 3 00 

acaaaagtga ttgctttcac cgtcaccgaa ggcgggtact acctgaatac cagtcacaaa 360 

ctggaagtta acaatcctga tttagcggca gatcttaaag ggggatgcaa aacaatttac 420 

ggtgttatta cccgtatcct cgaagcgcgt atggcaaata acgccggacc actaaccctg 4 80 

atgaacgaac aatttacatg gctgcacatc gggttaggtt cttttcatcg cgcacatcag 54 0 

gcgtggtatc tacaccgttt gcaggtgatg ggcgataaac gctggagcat tgctgcgggc 6 00 

aatattcgta atgatgctga acatgtcgta caggcactca gtgcacagaa aggtcgctat 660 

gtgctggaaa ccgtcagccc ggaaggggta agcgaatatg aagagatcac ctcaattcag 72 0 

aagttgatac cgtggcaggc agatttacaa ccgctgattg ctgaaggggc agatccgaag 780 

acaaaagtga ttgctttcac cgtcaccgaa ggcgggtact acctgaatac cagtcacaaa 840 

ctggaagtta acaatcctga tttagcggca gatcttaaag ggggatgcaa aacaatttac 900 

ggtgttatta cccgtatcct cgaagcgcgt atggcaaata acgccggacc actaaccctg 960 

ctgaattgcg ataacgtgcg ccataatggt gaacgtttcc atgatggcct ggttgagttt 1020 

ctccagctaa ctggcaaaca ggatgtcatc gactggctga gtacaaatac cacttgcccg 1080 

aataccatgg ttgaccgcat tacgcctcgt ccggcagcag aacttccggc acggatcaag 114 0 

gctcaaacgg gtattgccga taaagcgccg gtaatgggcg aaacctttat ccagtgggtc 1200 

gtggaagata atttccgtga tgtccgtccg gcactggaga aggtcggtgt cgaactggtg 1260 

gcgtcggtaa tcccctatga agaggcgaag attcgcattc ttaactcttc acacagttgc 1320 

atcgcctggg caggtacgtt aatcggtcaa aaatatatcc acgaaagcac aatgaccgat 13 80 

tttatctatc agattgccga ccgctacgtg acggaagatg tcattccttg cttgggcgat 1440 

aacggtatcg atttgccaac ctaccgtgat gttgtactca agcgttttac caatccacat 1500 

attcaggaca ccaaccaacg cgtcgctgcg gatggtttct cgaaaattcc ggcgatgatt 1560 

gcccccacac tgcgagagtg ctaccagcga ggcgttcgcc cgaatgccac cgccatgtta 162 0 

cctgcactgt tttacgtatt catggagcag tggcatcacg gcaaactgcc ctatgaatat 1680 

caggatggca tccttgatgc accagctgtc catgcaatgt tacagtctgc cgatcccgtc 1740 

gctgtttatg ccagtgataa agcgctgttt ggcgatttaa ccgaacgtga agattttgcc 1800 

gcgttgttgc gcgaaaaaat cgctgacgtc tacgcgttaa ttaactaa 1848 
<212> Type : DNA 
<211> Length : 1848 
SequenceName : SEQ ID NO 1 

SequenceDescription : arabitol dehydrogenase 
Custom Codon 

Sequence Name : SEQ ID NO 1 
Sequence 



<213> OrganismName : Escherichia coli 
<4 00> PreSequenceString : 

atcgattgag cagtttgctt cacacggcag ctaaattccc gttcagtgcg tgcaagcaaa 60 

cagagactat aaattcgccc tggtaaaagg attatatgat gaatcactct gtgccctcta 120 

tgaatactcc ccttaatggc aaagttgcag ctatcactgg cgctgcgtca ggtattggcc 180 

tgcaatgtgc aaaaacgctg ctcgatgcag gagcaaaggt agtattgatt gaccgggaag 240 

gcgacaaact gcacaagatt gtcgctgagt taggcgaaaa cgcgtacgcg ctgcaactcg 3 00 

atctcttcaa taatcagcaa gtcgataaca tgctggcgga cattatcgaa ctggcgggtg 360 

ggctggatat ttttcatgcc aatgcaggcg cttatattgg cggcccagtg gctgaaggtg 420 

atccagatgt ctgggatcgt gtgttaaatc tgaatataaa tgcggcgttt cgctgtgtcc 480 

gtgcagtcct gccgcatatg attgcgcaga ggtcgggcga tataattttt accagttcca 540 

tcgcgggcgt cgttccggtt atctgggaac cgatctacac cgcgtccaaa tttgccgttc 600 

aggcattcgt acacactacc cgccgccagg tttctcaata tggcgtgcgt gtgggtgcgg 660 

tgctgccagg accagtagtc actgccctgc ttgatgactg gccaaaagcc aaaatggaag 72 0 

aagccctggc aaatggtagc ctgatgcaac cgattgaagt ggcggaatca gtattgttta 7 80 

tggtgacccg ctcgaaaaat gtcaccgtgc gagatttagt gatcctgcct ggcagtgtcg 840 

atctgtaagg gcgcaatcat gacaataacc aaaaccgtta ttggtgttga tgtgggatca 900 

ggcagtgtcc gcgccgggat ttttgatctc aacggatctc tgctatccca tgccacagaa 960 

aaaatcacga ctacgcggcg cagcggaagc cgcgtggaac agtccagcca ggagatctgg 1020 

caggcggtct gttcatgtat tcgtaatgcg ctcactctgg cagacgtttg tgcacaaagt 1080 

gtggcaggca tcggttttga tgccacctgt tctctggtgg tactggataa aaacggtgat 1140 

ccattgcctg tcagcccgga aggagatgca aagcaaaata tcattgtgtg gatggatcac 12 0 0 

cgcgccaccg aacaagcaga gcgaatcaat gccactcacc atccggtgct gaactacgtc 1260 

ggtggtaaaa tttcgcctga aatggaaaca ccgaaaattc tctggctgaa agaaaatatg 1320 

ccagagatct acgaacgtgc cggacaattt ttcgatctgg ccgattttct gacctggcgg 13 80 

gctaccggtg atttagcgcg ttcagtatgc actgttacct gtaaatggac gtggctggca 1440 
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catgaaaatc gctgggatcc agattatttc cgcaccattg gccttgcaga gttagcggat 
gaagatttta ttcgcattgg tcatcatatt gtttctcccg gaacaccttg tggaaatggt 
ttaacagcac aagccgcggc agagatggga ttactccccg gcacacctgt cgctgtaggg 
ttaattgatg ctcacgctgg tggcatcggt acggttggcg tagaaggtgg agcgctgaac 
aatctcgcgt atgttttcgg cacttcttca tgcaccatgg catctaccac ttctccctcg 
tttgtaccgg gtgtctgggg gccgtattac agtgcgatgg ttccaggtct gtggttagtt 
gaaggcgggc aaagtgctgc gggagcagct attgaccagc tacttgattt ccatccggct 
gttgaagaag ctcgcgaaat ggcacaacgt gtgaatcagc ccctccccgt ctggcttgct 
gatcgaatcc tcgaaaaaac ggcgcaacca tcagatgctg tcgccctggc gaaagggcta 
cacgtggtgc cggaatttct cggaaatcgc gcccccttcg cagatcctca tgccagagcg 
gtaatttgtg gcctgggtat ggagcgagat ctggataatt tactcgcctt gtatatcgct 
ggattatgcg gaattggtta tggtctgcgc caaattctcg acgctcaaac agcgcaggga 
gtagtgagta aaaatatcgt tattagcggc ggtgccgggc agcatccact ggtacgacaa 
attctggcag atacctgcgg tattccggtc attaccacgc aatgctgcga accggtttta 
ttaggctcgg ccattcttgg tgctgtcgcc ggaaatattg caccttctgt tggcgaagcg 
atgcaacaat tcacccatgt ggataaatat tattatccgc aagaacgcta tcaatctctt 
catcatcgtc gatatgaggc ctataagcag ttgcagcata ctgcaaaatt actcagagac 
taattaacca gccgcctgac gctgttttca ggcaatcact aataacgact cactccggta 
atatcccgga gtgcattcat ctgcacccta aaaacgaggt ttatatgtcc agaaataata 
aacagtggtt gggtttgcca ctgcatctga tatggggata tatcgccatc gcagtattta 
tgactggtga tggtttcgaa ctcgcattct tatcgcacta tattaaatcg ttaggcttca 
cacctgcgga agcctctttt gcctttacgc tctacggcct ggcggctgcc ctttccgcct 
gggtttctgg ggtagtagcg gaaatcatca cgccgcaaaa aaccatgctg attggttttg 
tcctatggtg cgtattccat gttctgtttc tggtctttgg attaggacag gcaaactatg 
ggttaatcct gctgttttac gggattcgtg gtttagcgta tccgctattt ctctactcat 
ttatcgtcgt tattattcat aacgtgcgta gcgaaaattc cagttctgct ctgggttggt 
actgggcggt atattcagta gggatcggtg ttgctggcag ttatattccc agttttacga 
taccgattat gggtgaaatg ggaaccttat ggctggcact ggcgttctgc tttgctggcg 
gtgtcattgc catgatctcc ttgcgtcatg ttaaaacgcc tggacatatg cataatttaa 
ctccccgtga gaagtttgca gaattaagtc gggcagtaac tttactttat accaaccgca 
atatttttct ctccagtatt gtgcgcatta taaatacctt atcgttattc ggttttgcgg 
tcattatgcc aatgatgttt gtggatgaac tgggattcac cacctctgaa tggttgcagg 
tctgggcggc atttttcttt accactattt tctctaatat tttttggggg attgtggcag 
aaaaaatggg ctggatgcgt gttattcgct ggtttggttg cctcgggatg gcagcatcaa 
gtttagcgtt ttactacatg ccgcaatact ttggtcacaa ctactggatg gcaatgattc 
cggcgattgc tctgggaact tttgttgctg catttgtgcc gatggccgct gtcttcccgg 
cactggaacc aaaacacaaa ggtgctgcaa tctcggttta caacctctct gcgggtatgt 
ctaacttcct ggctccggca attgccgtgg tgttattacc gtggtttagc actatcggtg 
tggtcattgc ctatacagca ttgtatctat tggcctttgt cctttgcgca ttcattcgcg 
ttgagcagcc aggattcagt tctgcgccag tgactgagaa ggcattgaat atctcctgaa 
aaacgaaacg catcaggcac tcatcctctc cctcatggga gaggatgatt tcacatcagg 
caatagtgac tttgttatcg agataaacgt cctgcacggc gttaatcagt ttcacgccgt 
cagccatcga t 
<212> Type : DNA 
<211> Length : 3971 
SequenceName : SEQ ID NO 2 
SequenceDescription : ribitol operon 

Feature 



Sequence: SEQ ID NO 2 : 

<221> FeatureKey : misc_feature 

<222> LocationFrom : 96 

<222> LocationTo : 848 

Other Information : ribitol dehydrogenase coding region 
CDS Jo in : No 

Feature 



Sequence: SEQ ID NO 2: 
<221> FeatureKey : misc_feature 
<222> LocationFrom : 859 
<222> LocationTo : 2463 

Other Information : ribitol kinase coding region 
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CDSJoin : No 



Feature 

Sequence: SEQ ID NO 2: 
<221> FeatureKey : misc_f eature 
<222> LocationFrom : 2565 
<222> LocationTo : 3839 

Other Information : ribitol transporter coding region 
CDSJoin : No 



Custom Codon 



Sequence Name : ribitol operon 



Sequence 

<213> OrganismName : Escherichia coli 
<400> PreSequenceString : 

MMNHSVPSMN TPLNGKVAAI TGAASGIGLQ CAKTLLDAGA KWLIDREGD KLHKIVAELG 6 0 

ENAYALQLDL FNNQQVDNML ADIIELAGGL DIFHANAGAY IGGPVAEGDP DVWDRVLNLN 12 0 

INAAFRCVRA VLPHMIAQRS GDIIFTSSIA GWPVIWEPI YTASKFAVQA FVHTTRRQVS 180 
QYGVRVGAVL PGPWTALLD DWPKAKMEEA LANGSLMQPI EVAESVLFMV TRSKNVTVRD 240 
LVILPGSVDL 
<212> Type : PRT 
<211> Length : 250 
SequenceName : SEQ ID NO 3 

SequenceDescription : ribitol dehydrogenase 
Sequence 



<213> OrganismName : Escherichia coli 
<4 00> PreSequenceString : 

MTITKTVIGV DVGSGSVRAG IFDLNGSLLS HATEKITTTR RSGSRVEQSS QEIWQAVCSC 60 

IRNALTLADV CAQSVAGIGF DATCSLWLD KNGDPLPVSP EGDAKQNIIV WMDHRATEQA 120 

ERINATHHPV LNYVGGKISP EMETPKILWL KENMPEIYER AGQFFDLADF LTWRATGDLA 180 

RSVCTVTCKW TWLAHENRWD PDYFRTIGLA ELADEDFIRI GHHIVSPGTP CGNGLTAQAA 24 0 

AEMGLLPGTP VAVGLIDAHA GGIGTVGVEG GALNNLAYVF GTSSCTMAST TSPSFVPGVW 300 

GPYYSAMVPG LWLVEGGQSA AGAAIDQLLD FHPAVEEARE MAQRVNQPLP VWLADRILEK 360 

TAQPSDAVAL AKGLHWPEF LGNRAPFADP HARAVICGLG MERDLDNLLA LYIAGLCGIG 420 

YGLRQILDAQ TAQGWSKNI VISGGAGQHP LVRQILADTC GIPVITTQCC EPVLLGSAIL 480 

GAVAGNIAPS VGEAMQQFTH VDKYYYPQSR YQSLHHRRYE AYKQLQHTAK LLRD 534 
<212> Type : PRT 
<211> Length : 534 
SequenceName : SEQ ID NO 4 
SequenceDescription : ribitol kinase 

Sequence 



<213> OrganismName : Escherichia coli 
<400> PreSequenceString : 

MSRNNKQWLG LPLHLIWGYI AIAVFMTGDG FELAFLSHYI KSLGFTPAEA SFAFTLYGLA 60 
AALSAWVSGV VAEIITPQKT MLIGFVLWCV FHVLFLVFGL GQANYGLILL FYGIRGLAYP 120 
LFLYSFIWI IHNVRSENSS SALGWYWAVY SVGIGVAGSY IPSFTIPIMG EMGTLWLALA 180 
FCFAGGVIAM ISLRHVKTPG HMHNLTPREK FAELSRAVTL LYTNRNIFLS SIVRIINTLS 240 
LFGFAVIMPM MFVDELGFTT SEWLQWAAF FFTTIFSNIF WGIVAEKMGW MRVIRWFGCL 300 
GMAASSLAFY YMPQYFGHNY WMAMIPAIAL GTFVAAFVPM AAVFPALEPK HKGAAISVYN 360 
LSAGMSNFLA PAIAWLLPW FSTIGWIAY TALYLLAFVL CAFIRVEQPG FSSAPVTEKA 420 
LNIS 

<212> Type : PRT 
<211> Length : 424 
SequenceName : SEQ ID NO 5 
SequenceDescription : ribitol transporter 



424 



